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ABSTRACT 


V  A  computational  algorithm  for  the  determination  of 
a  piece-wise  linear  approximation  to  an  arbitrarily  specified 
function  of  one  variable  is  described.  In  particular  the 
algorithm  generates  the  optima  piece-wise  linear  approximation 
consistent  with  a  specified  accuracy  in  the  sense  that  the 
number  of  segments  is  minimized.  It  is  demonstrated  that  in 
contrast  to  the  problem  of  minimizing  the  maximum  error  with 
a  specified  number  of  segments,  this  formulation  leads  to  a 
computation  based  only  on  local  values  of  the  given  function 
and  a  correspondingly  efficient  computational  procedure. 
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i'iece-Wise  Linear  Approximations 


The  representation  of  arbitrarily  given  curves  in  terms 
of  piece-wise  linear  approximations  is  a  technique  of  importance 
not  only  in  many  digital  computer  applications,  but  is  widely 
employed  in  analog  computer  embodiments  for  function  represen¬ 
tation.  For  many  applications  in  the  digital  realm  it  is  a 
strong  competitor  to  rational  function  approximation  techniques 
both  with  respect  to  the  compactness  of  the  representation  and 
the  speed  with  which  the  approximation  may  be  evaluated.  Of 
particular  interest  on  both  counts,  speed  and  compactness,  are 
approximations  comprised  of  the  minimum  number  of  linear  segments 
consistent  with  the  level  of  accuracy  desired.  While  the  problem 
of  discovering  a  minimum  segment  approximation  to  an  arbitrarily 
specified  function  has  been  previously  treated  by  mathematical 
programming  techniques  (see  for  example  Gluss  1 1.  j  where  it  is 
formulated  as  a  dynamic  program) ,  the  procedure  described  herein 
treats  the  problem  rather  differently  and  leads  to  an  efficient 
computational  algorithm. 

The  particular  problem  which  is  considered  in  this 
paper  is  that  of  approximating  a  single  valued  function  of  one 
variable  in  terms  of  a  sequence  of  linear  segments.  More 
formally  we  assume  a  given  function  f(x)  defined  on  the  interval 
[a,bj,  and  we  seek  an  approximating  function  g(x)  such  that 
|f(x)-g(x)js  for  a<x<b.  The  function  g(x)  is  to  consist  of  a 
sequence  of  linear  pieces  so  that  for  every  x  in  ja,b],  g(x)  is 
of  the  form  g(x)=a+f3x.  More  particularly  values  for  i=0,l,2, 
...,n  are  to  be  selected  having  the  properties  that  a=Xg, 
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xi<xi+l’  and  xn~b’  s0  that  for  xi-l~x<xi’  g(x)=9i(x)=ai+pix.  Of 
all  approximating  functions  g(x)  which  meet  the  above  conditions 

we  are  interested  in  selecting  one  which  is  comprised  of  the 
minimum  possible  number  of  linear  segments.  "Two  cases  are  to 
be  considered;  (1)  the  function  g(x)  may  be  discontinuous  at  the 
boundaries  between  the  linear  segments,  (2)  the  function  g(x) 
must  be  continuous  everywhere  on  [a,b]. 

Consider  first  the  case  in  which  discontinuities  are 
permitted  at  the  linear  segment  boundaries.  An  example  of  a 
three  segment  approximation  of  this  variety  is  given  in  Figure  1. 
In  this  case  the  algorithm  may  begin  by  selecting  the  longest 
single  linear  segment  which  satisfies  the  error  restriction 
everywhere  from  a  to  the  right -most  boundary  of  the  initial 
segment,  i.e.,  by  finding  a^,^,  and  such  that  |  (o^  +  f^x) 
-f(x)j<b  for  a<x<x^  and  moreover  is  as  large  as  possible.  It 
is  clear  that  the  segment  g^(x)=(a^  +  for  a  <x<,x^  may  be 

taken  as  a  component  of  the  optimal  approximation  since  any 
other  choice  of  a  left-most  segment  would  result  in  a  "remaining 
function  to  be  approximated"  which  includes  the  interval  [Xj,b|. 
It  is  obviously  not  possible  that  the  number  of  linear  segments 
required  to  approximate  f(x)  over  [f;,b]  is  less  than  the  number 
required  over  [x^,b],  where  £<'x^,  since  any  M  segment  approxi¬ 
mation  of  f(x)  over  [£,b]  is  itself  an  approx iroair. ion  over  the 
subinterval  [x^,b|  consisting  of  at  roost  M  segments. 

The  algorithm  proceeds  by  repeated  selection  of  the 
linear  segment  of  maximum  scope  from  the  left-hand  end  of  the 
"remaining  function  to  be  approximated". 


Since  we  are  primarily  concerned  with  the  determination 
of  approximations  to  arbitrarily  given  functions  by  means  of 
a  digital  computer,  we  will  deal  with  a  slightly  different 
problem  in  which  we  assume  that  the  function  £(x)  is  defined  only 
at  a  discrete  set  of  points  which  we  take  to  be  the  non-negative 
integers.  Thus  we  restate  the  problem  of  selecting  the  initial 
linear  segment  as  fol. lot's.  Given  real  numbers  6,f(0),f(l), 
f(2),...,  find  real  numbers  a,  |i,  and  n  such  that  j  f  ( j )  —  <  o  *  p  j )  | 
for  0<j<n  and  n  is  maximal. 

Note  that  for  each  value  of  j  in  0<j<n  it  follows  that 
aach  of  the  following  relations  must  hold: 

a  +  pj<f(j)  +  * ,  and 
a  +  pj>f<j)  - 

Note  that  each  of  the  above  relations  may  be  regarded  as  defining 
a  half-space  in  the  two  dimensional  space  whose  coordinate  axis 
are  a  and  p.  Thus  each  value  of  j  may  be  associated  with  two 
half-spaces  the  intersection  of  which  defines  a  strip  in  the 
a,p  plane.  This  strip  includes  the  parameters  of  all  of  the 
linear  segments  which  meet  the  error  conditions  for  that 
particular  value  of  j.  Clearly  values  of  a  and  p  exist  satisfy¬ 
ing  |f(j)  -  (a+(3j)|<^  for  all  0<j<n,  just  in  case  the  intersection 
of  all  the  associated  half  spaces  f.v  not  empty.  This  observation 
suggests  the  computational  algorithm.  We  begin  with  j=0  and 
determine  the  equations  of  the  corresponding  half  planes.  We 
next  set  j=l  and  find  the  "corners”  of  the  convex  polygon  which 
is  the  intersection  of  the  4  half  planes.  Note  that  at  this 
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point  the  convex  polygon  will  be  in  fact  a  parallelogram.  We 
next  increment  j  by  one  and  determine  the  resulting  cumulative 
intersection  which  will  be  either  null  or  a  convex  polygon.  We 
proceed  in  this  manner  until  we  have  found  the  largest  value  of 
j  such  that,  the  cumulative  intersection  is  not  empty.  Having 
found  the  largest  such  value  of  j  we  may  take  as  the  parameters 
of  the  initial  linear  segment  any  value  of  a  and  J3  within  the 
corresponding  polygon. 

While  the  preceding  discussion  outlines  an  algorithm 
which  is  of  interest  in  the  determination  of  piece-wise  linear 
approximations  in  the  case  where  discontinuities  are  permitted, 
a  relatively  minor  variant  of  that  procedure  handles  the  more 
interesting  case  in  which  g(x)  is  continuous.  While  the 
principle  reason  for  our  interest  in  the  continuous  case  is 
that  discontinuities  frequently  are  unacceptable  because  of 
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the  way  in  which  the  approximation  is  employed  in  subsequent 
computations,  the  continuous  approximation  has  the  interesting 
side  benefit  that  it  leads  to  a  more  compact  representation. 

This  is  due  to  the  fact  that  although  the  number  of  linear 
segments  in  a  continuous  approximation  may  be  somewhat  greater 
than  in  the  discontinuous  case  each. segment  may  be  defined  by 
two  rather  than  three  parameters.  To  illustrate  the  procedure 
for  a  continuous  approximation  we  return  to  the  consideration 
of  a  continuous  f(x)  and  investigate  some  of  the  properties  of 
the  linear  segment  such  that  |  (o  +  Jix)  -  f(x)|<ft  for  a<x<x^  for 
the  largest  possible  x^.  First,  it  is  clear  that  jf (Xj ) - (n +px^ ) 
-b  since,  if  this  were  not  the  case,  it  would  be  possible  to 
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to  increase  x^.  Let  us  assume  that  n+px^=f (x^) +6 .  A  theorem 
due  to  Tchebysheff  tells  us  that  there  must  be  at  least  3  points 
at  which  the  error  takes  on  its  maximum  value  and  that  the  sign 
of  the  error  at  these  points  alternates  as  we  move  from  one 
point  to  the  next.  Thus  there  is  a  right-most  point  Xq^;.^  at 
which  or+pxQ=f  (xQ)  -6 .  We  show  in  Appendix  C  that  the  point  xQ 
has  the  interesting  property  that  there  is  in  fact  no  point 
z>xQ  such  that  a+pz=f(z)-6  and  for  all  a£x<z,  jf  (x)  -  (rr+px)  |<*>. 

Suppose  that  by  some  process  we  have  determined  the 
parameters  of  the  left-most  line  segment,  that  it  remains  within 
the  error  bound  for  a<x<y^,  and  at  y^  we  have  a+{3y^=f  (y^) . 
Suppose  further  that  yQ  is  the  righti-most  point  in  the  interval 
(a,  ’/^)  at  which  the  linear  segment  touches  the  lower  error 
boundary,  i.e.,  a+PyQ=f (yg) -b -  We  now  observe  that  the  re¬ 
maining  problem  is  that  of  finding  a  set  of  connected  line 
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segments,  minimum  in  number,  such  that  for  yQ<x<b,  g  (x)  (x) -*> , 

and  for  y^<x<b,  g (x) <f (x) +* .  Any  set  of  connected  line  segments 
which  meets  the  above  condition  will  contain  a  line  which 
intersects  the  line  ir+p::)  along  the  segment  joining  the  point 
(yQ,  f(yQ)-M  to  the  point  (y^,  f(y^)+M.  Thus  if  we  define 
this  point  of  intersection  as  the  right-most  boundary  of  the 
in  It  1-1  cogi..ont  are  ascuref  of  continuity  it  the  hour,  airy  and 
->ver  the  segment.  We  next  note  that  the  difficulty  (measured 
in  terms  of  the  number  of  segments  required)  of  the  remaining 
approximation  problem  (after  the  first  segment  has  been  selected) 
can  only  be  reduced  by  moving  either  or  both  of  the  points  yQ 
and  y^  to  the  right,  since  this  diminishes  the  range  over  which 
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the  approximation  is  constrained  by  the  upper  and/or  lower 
error  boundaries.  Since  the  result  of  Appendix  C  shows  that 
both  Yq  and  are  maximized  by  the  saiue  linear  segment  we 
cannot  do  better  than  to  use  that  segment. 

We  note  that  after  the  first  linear  segment  has  been 
estab] ished  the  remaining  approximation  problem  differs  from 
the  original  problem  in  that  the  upper  and  lower  error 
boundaries  do  not  begin  at  the  same  point.  This  does  not 
complicate  the  problem  in  any  essential  way,  however,  and  may 
be  easily  tak4n  into  account  by  selecting  the  appropriate  half¬ 
spaces  to  enter  into  the  computational  algorithm. 

.\n  intuitively  satisfying  characterization  of  the 
procedure  is  illustrated  in  Figure  2  in  which  the  upper  and 
lower  boundaries  are  regarded  as  defining  a  two  dimension 
tube  into  which  a  straight  stick  is  pushed  from  the  left. 

Figure  2a  illustrates  the  terminal  position  of  the  stick.  This 
position  of  the  stick  defines  the  first  line  in  the  approxi¬ 
mating  sequence.  That  part  of  the  upper-  boundary  to  the  left 
of  point  b  and  that  to  the  left  of  point  a  on  the  lower 
boundary  are  then  cut  away  and  the  process  is  repeated.  This 
leads  to  the  terminal  stick  position  of  Figure  2b  which  defines 
the  second  line  in  the  sequence.  The  articulation  point 
between  the  first  two  sections  is  obviously  their  point  of 
intersection.  A  computational  algorithm  treating  the  discrete 
case  is  described  in  Appendix  A.  An  efficient  procedure  for 
handling  the  intersection  of  a  half-space  and  a  convex  polygon 
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is  given  in  Appendix  B,  while  Appendix  C  presents  the  theorem 
on  the  basis  of  which  the  optimality  proof  rests. 

While  the  preceding  discussion  has  been  in  terms  of 
a  uniform  band  of  tolerable  error  it  should  be  apparent  that 
the  uniformity  of  the  error  is  in  no  way  essential  to  the 
method  and  in  fact  the  computational  algorithm  outlined  in 
Appendix  A  is  given  in  terms  of  an  arbitrary  "upper"  and  "lower" 
curves.  While  these  curves  may  be  defined  as  (f(x)+&)  and 
(f  respectively  they  may  also  be  established  by  an  error 

condition  which  depends  on  either  the  independent  or  dependent 
variable.  Thus  there  may  be  regions  of  x  in  which  the  error 
is  not  particularly  important  or  the  tolerable  error  may  be 
more  reasonable  specified  as  a  given  percentage  of  the 
independent  variable. 

It  is  also  possible  to  employ  a  variant  of  this 
technique  to  represent  general  paths  in  2  or  more  dimensions 
where  the  path  is  specified  by  the  coordinates  of  an  ordered 
set  of  points  along  the  curve.  This  case  may  be  handled  by 
using  the  algorithm  of  Appendix  A  to  obtain  approximations 
to  the  parametric  functions  which  describe  the  variation  of 
each  of  the  coordinates  as  functions  of  the  index  variable 
over  the  original  point  set.  The  index  variable  may  then  be 
eliminated  between  the  parametric  approximations  and  a 
piece- wise  linear  approximation  to  the  general  path  in  2  or 
more  dimensions  may  be  obtained. 

(1)  B.  Gluss,  "A  Line-segment  curve-fitting  Algorithm"  In¬ 
formation  and  Control,  Vol.  5,  No.  3,  Sept.  1S62 
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Appendix  A 


An  Algorithm  for  the  Determination 
of  Piece-Wise  Linear  Approximations 

The  problem  with  which  this  algorithm  deals  is 
illustrated  in  Fig.  3.  C1(M)  is  ordered  set  of  points  defining 

the  "bottom"  curve  and  C2(M)  is  an  ordered  set  of  points  de¬ 
fining  the  top  curve.  M  is  employed  as  an  index  variable  over 
both  Cl  and  C2.  What  is  required  is  a  set  of  lines  which  de¬ 
fine  a  chain  of  connected  line  segments  such  that  all  of  the 
points  on  Cl  are  "under"  the  chain  and  all  C2  points  are  "over" 
the  chain.  The  chain  is  defined  by  a  sequence  of  lines 
LL(I) , 1=1 , 2 , . . .  where  the  first  segment  is  that  part  of  LL(1) 
between  its  intersection  with  the  vertical  line  through  M=0, 
and  its  intersection  with  LL(2),  the  next  segment  in  the  chain 
is  that  part  of  LL(2)  between  its  intersections  with  LL(1)  and 
LL ( 3) ,  etc. 

The  computational  algorithm  is  given  in  flow  diagram 
form  in  Fig.  4.  With  respect  to  Fig.  4  the  following  definitions 
are  in  order. 

Each  of  the  lines  in  Fig.  3  may  be  defined  in  terms 
of  a  y  intercept  and  a  slope,  e.g.  LL(1)  may  be  defined  by  the 
relation  y=a^+P^F*.  Thus  we  may  define  a  two  dimensional  space 
with  axes  a  and  {3,  each  point  of  which  corresponds  to  a  line  in 
Fig.  3.  The  notation  POLY  X  is  intended  to  refer  to  the 
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POINTS  ON  C2  ARE  DESIGNATED-0 


FIGURE  3  -  THE  DISCRETE  CASE 


FIGURE  4  -  FLOW  DIAGRAM  FOR  APPROXIMATION  ALGORITHM 


interior  and  the  boundary  of  a  convex  polygon  in  or>p  space. 

POLY  INIT  is  intended  to  denote  a  very  large,  essentially 
infinite,  convex  polygon  which  contains  in  its  interior  the 
points  which  correspond  to  any  line  which  could  conceivably 
be  a  member  of  the  approximating  sequence.  The  requirement 
that  M£  C2(M)  for  each  particular  value  of  M  r  ay  be 

regarded  as  defining  a  half-space  in  the  a,P  plane,  i.e. 
the  half-space  in  which  the  relation  holds,  and  this  half- 
space  is  denoted  by  S(C2(M)).  The  substitution  operation 
POLY  A+-POLY  B  ns(C2(M))  means  that  POLY  A  is  replaced  with 
the  polygon  which  results  from  the  formal  intersection 
of  POLY  B  and  the  half  space  S(C2(M)).  The  variable  LFTC1 
is  employed  to  denote  the  index  of  the  left-most  point  in 
the  set  Cl  which  is  to  be  considered  as  a  constraint  on  the 
line  segment  being  determined.  LFTC2  is  similarly  employed 
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to  denote  the  left-most  point  on  C2.  Thus  with  respect  to 
Fig.  3  for  LL(1),  LFTC1=LFTC2=0  since  the  first  line  must 
be  constrained  by  the  points  C1(0)  and  C2(0)  and  all  points 
to  the  right  insofar  as  possible.  The  general  strategy  is 
to  index  K  and  to  determine  at  each  step  the  polygon  in  the 
a,P  space  which  bounds  the  region  in  which  acceptable  line 
parameters  are  to  be  found.  Thus  in  Fig.  3  the  intersection 
of  the  12  half-spaces,  given  by  a^+P^M>Cl(M)  and  ai+PiM5Lc2(M) 
for  M=0,l,2,3,4,5  are  found  to  define  a  non-null  convex  polygon 
while  the  intersection  of  that  polygon  with  the  half-space 
defined  by  . 6£C2 (6)  is  null.  This  allows  us  to  choose 
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parameters  for  LL(1).  In  particular  since  LL(1)  fails  to  meet 
the  constraint  for  a  point  on  the  upper  curve,  we  choose  for 
the  parameters  of  LL(1)  the  least  slope  feasible  point  in  the 
polygon  which  defines  the  feasible  region  for  all  points  from 
M=0  through  M=5,  i.e.  the  line  LL(1)  will  come  as  close  as 
possible  to  satisfying  C2(6).  MMIN  is  defined  to  be  the  index 
of  the  point  on  Cl  which  determines  the  minimum  slope.  In  the 
case  of  LL(1)  in  Fig.  3.  MMIN=3.  Similarly  MMAX  is  the  index 
of  the  point  on  C2  which  determines  the  maximum  allowable 
slope.  SMAX  is  intended  to  denote  that  point  on  the  boundary 
of  a  polygon  in  the  a,P  plane  for  which  the  slope  is  maximum 
while  SMIN  is  similarly  intended  as  the  point  of  minimum  slope. 
The  notation  L(SMAX)  or  L(SMIN)  refers  to  the  line  in  the  y,M 
plane  which  corresponds  to  the  point  SMAX  in  the  ct,  (3  plane. 

Thus  in  the  case  of  the  first  line  in  Fig.  1  LL(1)  is  choosen 
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as  L(SMIN)  for  the  appropriate  polygon.  The  variable  ZZ  is 
reevaluated  for  each  approximating  line  and  is  the  smallest 
value  of  M  which  need  be  considered  in  the  determination  of 
that  line,  i.e.  ZZ=MIN(LFTC1,CFTC2) .  For  example  for  the 
determination  of  LL(2)  in  Fig.  3,  LFTC1=3,  LFTC2=6,  and  ZZ=3. 
Thus  LL(2)  must  be  above  every  point  on  Cl  from  M=3  on  to  the 
right  as  far  as  possible  while  JjL(2)  need  be  below  points  on 
C2  from‘M=6  on  to  the  right.  The  variable  T  denotes  the  total 
number  of  points  for  which  Cl  and  C2  are  defined.  The  condition 
M=T  terminates  the  algorithm. 
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An  Algorithm  for  Calculating  the  Intersection  of  a 
Convex  Polygon  and  a  Linear  Half-Space 

The  intersection  of  a  linear  half-space  and  a  convex 
polygon  is  either  null  or  a  convex  polygon.  The  input  data  on 
which  the  algorithm  is  based  is  a  description  of  a  convex 
polygon  plus  a  description  of  the  half  space,  the  output  is 
either  a  description  of  a  convex  polygon  or  an  indication  that 
the  intersection  is  null.  Since  the  algorithm  is  to  be  used 
recursively  we  require  that  the  format  of  the  output  description 
be  the  same  as  that  of  the  input  description.  The  format  for  the 
specification  !  the  cnnve-  polygon  is  -n  ?r Jared  list  >f  the 
"corners"  of  the  polygon  (•/ ;,th  the  first  corner  repeated  as 
the  last  corner)  plus  an  indication  of  the  number  of  such 
corners.  The  notation  employed  for  the  input  polygon  in  the 
flow  diagram  of  Pig.  5  is  n  for  the  number  of  corners,  and 
Aq ,  A^ ,  . ..,  A  for  the  coordinates  of  the  corners  with  A^An. 

The  notation  for  the  resultant  polygon  is  Bn,  Bj.,  ...,  Bm, 
with  L  y=B  .  The  half-space,  S,  is  denoted  by  the  equation 
of  the  separating  line  L.  The  term  (A^eS?)  asks  whether  the 
point  Ai  is  an  element  of  the  half  space  defined  by  the 
boundary  L.  The  term  LN (A^ ,  A^-l)  denotes  the  line  segment 
joining  the  point  A^  to  the  point  A^-l  and  the  term 
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(Lf  >LN  ( A^  ,  A^~l))  denotes  the  point  of  intersection  of  the 
separation  line  L  and  the  segment  L  A^-l)  •  A  geometric 

illusti ation  of  the  input  and  output  polygons  is  given  in 
Pig.  6.  The  algorithm  itself  is  illustrated  in  the  flow 
diagram  of  Fig.  5.  The  algorithm  is  based  on  the  fact  that 
the  "corners"  of  the  resultant  polygon  will  be  the  corners 
of  the  input  polygon  that  are  in  the  half- space  plus  the  points 
of  intersection  of  the  separating  boundary  and  the  boundaries 
of  the  input  polygon. 
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FIGURE  5  -  FLOW  DIAGRAM  OF  POLYGON  INTERSECTION  ALGORITHM 
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A  Theorem  on  Tchebvsheff  Approximation 


Let  f(x)  be  a  continuous  real  valued  function  defined 
everywhere  on  the  non-negative  axis.  Let  g(x)  be  an  arbitrary 
polynomial  of  degree  n  and  let  G  be  the  set  of  all  such  nth 
degree  polynomials.  Define  R(f,g)  to  be  the  largest  value  of 
x  such  that  for  all  0<y<x,  j f (y) -g(y) | <6 .  If  | f (0)-g(0) |>6, 
then  R(f,g)  is  defined  to  be  zero.  Let  xR  be  defined  as 

xR=max  R(f ,g) , 
geG 

and  denote  by  gD  the  polynomial  which  maximizes  R,  i.e., 

xR=R(f,gR). 

Define  T(f,g)  to  be  the  maximum  value  of  x  such  that  for  all 
0£y<x,  |f(y)-g(y)  |£6  and  further  g(x)=f(x)+6.  Let  xT  be 
defined  as 


xT=max  T(f ,g) , 
geG 


and  denote  by  gT  the  polynomial  which  maximizes  T,  i.e., 

xT=T(f ,gT) . 

Define  B(f,g)  to  be  the  maximum  value  of  x  such  that 
for  all  0<y < x ,  |f(y)-g(y)  \<b,  and  further  g(x)=£(x)-6.  Let 

x0  be  defined  as 

x=max  B(f,g) 

^  geG 
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and  denote  by  gB  the  polynomial  which  maximizes  B,  i.e., 

XB=B(f,gB). 

Less  formally,  if  we  think  of  an  upper  boundary  given 
by  f+6  and  a  lower  boundary  given  by  f-6,  gR  is  the  polynomial 
which  stays  within  the  error  band,  i.e.,  the  interval  between 
f+6  and  f-6,  as  far  to  the  right  as  possible  and  xR  the  point 
at  which  it  leaves  the  error  band.  gT  is  the  polynomial  which 
stays  in  the  error  band  everywhere  from  zero  to  xT  at  which 
point  it  contacts  the  "top"  boundary.  xT  is  as  far  to  the 
right  as  possible.  gR  is  similarly  defined  except  that  at  the 
point  Xg  the  polynomial *gB  contacts  the  "bottom"  boundary.  The 
theorem  we  wish  to  prove  may  now  be  stated  as  follows: 

Theorem  1:  xR=maximum  (xT,x0) ,  and  further  gR=gT=gB. 

The  first  part  of  the  theorem,  i.e.,  xR=maximum 
(xT,Xg)  may  be  established  as  follows.  Assume  that  max 
(xT,xB)  =  xT.  It  cannot  be  the  case  that  xR<’xT  since 
R(f,gT)>xR  contrary  to  hypothesis.  Similarly  it  cannot  be  that 
xR>xT  since  in  this  case  either  T(f,gR)>xT  or  B(f,gR)>Xg  both 
contrary  to  hypothesis.  Since  it  cannot  be  the  case  that 
xR>xT  or  that  xR>x.r,  it  must  be  that  xR=xT.  A  similar 
argument  can  be  advanced  for  the  case  max  (xT,xB)=xB  by  simply 
reversing  the  roles  of  xT  and  Xg  in  the  above  argument. 

For  the  purpose  of  the  following  argument  let  us 
again  assume  that  xR=max(xT, Xg ) =xT.  Clearly  gR=9T  and  in  order 
to  complete  the  proof  of  the  theorem  we  must  show  that  gn=gD. 
Toward  this  end  we  employ  a  theorem  due  to  Tchebysheff  which 
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says  in  effect  that  the  quantity  |f(y)-gR(y) I  takes  on  its 
maximum  value  at  least  n  +  2  times  over  the  interval 

say  xi<'x2<^  *  *  *<n+l^xn+2=xR’  and  °f  the  error  at 

successive  points  alternates.  Thus  if  n  is  even  and  g(xR)= 

f(xD)+{>  we  have  that  g(x  . )  =f  (x . )  +6  for  j=2i,  l£i£7  +  1  and 
k  J  J 

similarly  g(Xj  )  =f  (x^  ) -6  for  j=2i-l,  l£.i£.2  +  1-  We  will  have 

our  desired  result  if  we  demonstrate  that  xn+i=xB*  us 

assume  that  it  is  not  the  case  that  Xg=xn+;L  but  that  xr+^ 

<'x0<'xR=xT.  Since  xn4l  is  the  rightmost  point  at  which 

gR=f-6  it  follows  that  gR(xR) >gR (Xg) .  We  note  that  gR  is  a 

nth  degree  polynomial  determined  by  the  n+l  points 

(x1?f  (x^-6)  ,  (x2,f  (x2) +6)  , . . .  ,  (xn,f  (xn)+6)  ,  (xn+1,f  (xn+i)_6)  * 

Similarly  gfi  is  an  nth  degree  polynomial  determined  by 

(xx,f  (x1)-6-t/'1)  ,  (x2,f  (x2) +^-Ao)  , .  •  •  ,  Un,£(xn)+?>-6n)  , 

(x  j  -  m  f  ( x  .,)-*+A  )  in  which  Aj>0.  Consider  the  nth  degree 

n+i 7  n+i  n+i 

polynomial  h(x)  defined  by  the  points  (x^ , A^) , (x2 , -^2) » • • • j 
(x  ,-A  ) (x  ,,,A  , . ) .  It  must  be  true  that  g  =g_+h  since 
they  are  all  nth  degree  polynomials  and  gR  coincides  with 
g_+h  at  n+l  points.  We  note  that  h(x)  changes  sign  between 
every  pair  of  points  so  that  all  of  the  roots  of  h(x)  are 
accounted  for  and  h(x)  must  be  positive  for  all  r>’tn+2>*  '^e 

have  that  h(xB)>0  or  that  gB  (Xg)  ^(Xg) .  Thus  we  have  a 
contradiction  resulting  from  the  assumption  that  xg>xn+]_* 

Since  B(f,gR)=xn+1  it  cannot  be  that  xR<xn+1  and  therefore 
Xg=xn+1  and  gB=gR.  Similar  arguments  can  be  advanced  for 
the  case  of  n  odd  and  also  for  the  case  in  which  xR=Xg . 
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